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Abstract  t -  / 

propose  a  new  approach  to  tlye  integration  of  functional  and  logic  la/iguages,  based 
on  a  theory  of  unification  and  set-vAlued  functions.  A  set-valued  function  maps  a  tuple 
of  input  sets  into  an  output  set.  ^fe-describes  a  language  called  Setlog  which  illustrates 
this  approach,  and  givedts  model- theoretic,  fixed-point,  and  operational  semantics.  The 
model-theoretic  semantics  and  fixed-point  semantics  resemble  that  of  Horn  logic.  The 


operational  semantics  uses  outermost  reduction  (for  set-valued  functions)  and  unification 
(for  terms).  'tV^-«t&bKsh  the  correctness  of  the  operational  semantics  through  soundness 
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and  completeness  proofs. 
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1.  Introduction 

The  integration  of  functional  and  logic  programming  remains  a  challenging  problem, 
despite  the  many  proposals  that  have  appeared  since  Robinson’s  LOGLISP  in  the  late  sev¬ 
enties  [BBLM84,  DP85,  DFP86,  F84,  GM84,  HHT82,  JL87,  JS86,  JSG86,  L85,  MMW84, 
R84,  R85,  RS82,  SP85,  T84,  YS86].  The  challenge  lies  in  designing  a  language  that  of¬ 
fers  semantic  simplicity,  computational  efficiency,  programming  power  and  convenience. 
By  semantic  simplicity,  we  mean  that  the  language  is  amenable  to  a  logical  treatment, 
i.e.,  model  theory  and  proof  theory;  by  computational  efficiency,  we  mean  the  language  is 
amenable  to  a  straightforward  procedural  interpretation;  by  programming  power,  we  mean 
the  language  provides  at  least  the  capabilities  of  first-order  functional  and  Horn-logic  lan¬ 
guages;  and,  by  programming  convenience,  we  mean  that  programs  model  the  problem 
domain  as  closely  as  possible. 

We  offer  an  approach  based  on  a  theory  closely  related  to  predicate  logic — set-theory. 
We  use  definite  clauses  to  define  set-valued  functions.  Functional  expresssions  denote 
(possibly  infinite)  sets  of  terms.  Computation  involves  proving  set-membership  assertions, 
i.e.  solving  for  variables  which  make  the  assertions  true.  The  operational  procedure  uses 
concepts  central  to  functional  and  logic  programming:  outermost  reduction  and  unification 
of  terms. 

The  language  within  which  we  develop  these  concepts  is  called  Setlog.  We  provide 
both  a  model-theoretic  semantics  and  an  equivalent  fixed-point  semantics  for  Setlog  pro¬ 
grams,  along  the  lines  of  van  Emden  and  Kowalski  [VK76].  We  also  provide  an  operational 
semantics,  based  on  outermost  reduction  of  functions  and  unification  of  terms,  and  prove 
its  correctness  with  soundness  and  completeness  proofs.  Our  approach  clearly  separates 
function  symbols  from  constructors.  Unification  is  restricted  to  first-order  terms;  function 
application  uses  one-way  substitution.  Setlog  can  consequently  be  extended  to  include 
higher-order  functions  without  requiring  higher-order  unification,  which  is  known  to  be 
undecidable  [H71]. 

A  Setlog  program  is  basically  a  collection  of  definite  clauses  involving  set-membership 
assertions,  and  may  take  one  of  two  forms: 

1.  term  E  /(pi,...,pn) 

2.  term  E  f(pi, . . . ,  pn)  term  E  set- expression,  . . . ,  term  E  set-expression 
Setlog  terms,  as  in  logic,  are  made  up  of  constructors,  atoms,  and  logical  variables.  The 
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arguments  pi, . . .  ,pn  in  the  above  rules  are  formal  parameter  variables,  and  they  must  be 
mutually  distinct.  Because  Setlog  distinguishes  function  symbols  from  constructors,  the 
formal  parameters  of  a  function  are  distinguished  from  the  logical  variables  that  appear 
in  terms;  formal  parameters  denote  sets  of  terms,  whereas  logical  variables  denote  single 
terms.  Set  expressions  are  built  up  of  composition  of  set-valued  functions,  and  the  primitive 
set  constructor  is  {  },  which  must  contain  a  single  term.  The  symbol  «—  should  be  read 
as  “if”;  that  is,  the  membership  asserted  on  the  left  of  «—  is  true  if  all  the  memberships 
asserted  on  the  right  of  «—  are  true.  The  top-level  goal  is  of  the  form 

?-  term  G  set-expression. 

The  goal  may  also  be  a  conjunction  of  such  expressions.  The  overall  computation  of  a  Setlog 
program  proceeds  similar  to  Horn-clause  resolution,  except  that  unification  is  restricted  to 
terms,  with  outermost  reduction  used  to  reduce  set-functions. 

1.1  Related  Work 

Several  declarative  semantic  bases  integrating  functional  and  logic  languages  have 
been  proposed:  many-sorted  Horn-clause  logic  with  equality  [GM84],  equational  logic  [F84, 

GM84,  DP85,  YS86],  and  others  [MMW84,  T84).  Various  operational  strategies  have  also 
been  proposed:  narrowing  [GM84,  DP85,  R85,  YS86],  clausal  superposition  [F84],  etc. 

[JS86,  MMW84].  Setlog  differs  from  these  various  approaches  in  that  it  is  based  on  sets, 
rather  than  equations  or  predicates.  Because  Setlog  does  not  rely  on  equations,  it  avoids 
a  computationally  expensive  unification  relative  to  an  equational  theory;  instead  it  uses  a 
simple,  syntactic  unification  algorithm. 

In  a  previous  paper  [JS86],  we  described  a  language  called  EqL,  which  is  closely 
related  to  Setlog.  EqL  is  a  non-deterministic  equational  programming  language  combining 
functional  and  logic  programming.  A  key  idea  distinguishing  EqL  from  other  equational 
languages  is  its  separation  of  constructors  from  function  symbols.  A  program  in  EqL  has 
rules  of  the  form 

f[Pi » •  •  •  >Pn)  =  expression  where  subgoal-equations.  ?Qr _ 

where  each  p,  is  a  formal  parameter.  The  top-level  goal  is  a  set  of  equations.  An  equation  p] 

is  of  the  form  1  ^3 

OIL  ■  .  - 

expression  x  =  expression 3 

where  an  expression  could  have  both  constructors  as  well  as  functions.  Because  of  non-  n/ _ 
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determinism,  functional  expressions  denoted  sets  of  terms,  and  therefore  the  above  = 
predicate  is  satisfied  if  the  sets  denoted  by  expression^  and  expression^  have  non-empty 
intersection.  This  predicate  represents  true  equality  only  when  relating  two  deterministic 
expressions  (which  denote  singleton  term  sets.).  As  we  investigated  the  semantics  of  EqL, 
we  realized  that  the  equational-style  syntax  was  misleading,  as  our  equality  was  really 
set-intersection.  Improvements  to  the  syntax  led  to  better  understanding  of  the  semantic 
theory,  resulting  in  Setlog. 

The  need  to  deal  with  sets  in  functional  and  logic  programming  has  been  recognized 
by  several  researchers  [T81,  R84,  N85,  DFP86,  JS86].  Darlington  proposes  an  absolute  set 
abstraction  construct  for  collecting  together  the  solutions  of  a  set  of  equations  [DFP86]; 
this  extends  the  relative  set  abstraction  construct  of  Turner  [T81].  Naish  [N85]  surveys 
a  variety  of  all-solutions  predicates  proposed  for  Prolog.  Many  of  these  approaches  lack 
a  simple  declarative  semantics.  Plaisted  and  Jayaraman  show  one  way  to  obtain  sets 
for  functional  programming  without  sacrificing  their  declarative  semantics  [JP87];  their 
language  does  not,  however,  support  logic  programming. 

The  rest  of  this  paper  is  sectioned  as  follows:  section  2  presents  the  syntax  of  Setlog 
programs,  accompanied  by  examples  and  its  relation  to  functional  and  logic  languages; 
sections  3  and  4  give  the  model-theoretic  and  fixed-point  semantics  for  Setlog;  section 
5  is  devoted  to  the  operational  semantics  and  ideas  for  interesting  extensions;  finally, 
section  6  presents  conclusions  and  related  work.  Correctness  proofs  relating  the  formal 
and  operational  semantics  may  be  found  in  the  appendix. 

2.  Setlog 

2.1  Syntax  and  Informal  Semantics 

The  syntax  of  Setlog  programs  is  given  by  the  grammar  below.  Note  that  the  symbols 
— ►,  |,  and  c  are  meta-symbols,  and  do  not  belong  to  Setlog.  The  symbols  ,  G,  {,  },  (, 
),  ,,  and  ?-  are  language-defined  tokens.  The  symbols  setfunctor,  constructor,  atom ,  and 
logical-variable  represent  user-defined  tokens. 

program  — ►  clauses  goal 
clauses  — *  e  |  clause  clauses 
goal  — ►  ?-  body 
clause  — ►  unit  \  conditional 


conditional  — ►  head  <—  body 


head  — ►  term  6  setfunctorO  |  term  €  setfunctoripi,  ...,  pn) 

body  — ►  term  6  setexpr  j  term  €  setexpr,  body 

setexpr  — ►  {  term  }  |  setfunctorO  |  setfuncior  (setexpr s ) 

setexpr 8  — ♦  setexpr  |  setexpr,  setexpr s 

term  — ►  atom  |  logical-variable  |  constructor(terms) 

terms  — ►  term  \  term,  terms 

Note  that  we  distinguish  between  a  setfunetor  and  a  constructor.  The  former  is  used 
for  the  name  of  a  set-valued  function,  the  latter  for  a  data  constructor.  A  ground  term  is 
a  term  without  any  variables;  similarly  a  ground  set  expression  is  a  set  expression  without 
any  logical  variables.  In  the  rule  for  unit,  Pi,  -  ■■  ,pn  represent  formal  parameters.  Our 
convention  is  that  logical-variables  must  begin  with  an  uppercase  letter,  formal  parameters 
begin  with  a  lowercase  letter,  and  non-numeric  atoms  are  quoted  (as  in  LISP). 

We  use  the  constructor  cons  for  constructing  binary  trees,  as  in  LISP.  Lists  are  a 
special  form  of  binary  trees,  and  we  write  them  using  the  [...]  notation,  e.g.  [1.2.3]. 
[]  stands  for  the  empty  list,  and  is  regarded  as  an  atom.  Thus  the  list  [1.  2.  3]  is 
represented  as  cons(l,  cons (2.  cons (3.  []))).  We  use  the  notation  [H  |  T],  as  in 
Prolog  [CM81],  to  refer  to  a  non-empty  list,  with  head  H  and  tail  T.  Thus,  [H  |  T]  = 
cons(H,  T). 

We  also  permit  the  notation  {...}  to  define  a  set  literal,  e.g.  {1,2,3}.  This  is  just 
syntactic  sugar  for  a  set-function,  say  sl23(),  defined  as  follows: 

1  €  sl23() 

2  €  sl23() 

3  e  sl23() 

A  Setlog  program  consists  of  two  kinds  of  definite  clauses:  unit  and  conditional.  Both 
unit  and  conditional  clauses  assert  set  membership.  Examples  of  well-formed  unit  clauses 
are: 


1  €  sl23() 

[HjT]  6  fCsetl,  set2) 


Examples  of  well-formed  conditional  clauses  are: 


X  €  f  (si)  -  [X  |  T]  €  g(h({25}.  {X})) 

[X  |  T]  €  f  (si ,  s2)  f-  X6  g(sl),  T  6  h(s2) 

A  clause  may  introduce  logical  variables  in  any  singleton-sets  used  in  its  body,  e.g.  {X} 
in  the  first  example  above.  A  conditional  clause  states  that,  for  any  substitution  of  set 
expressions  for  its  formal  parameters  and  any  substitution  of  terms  for  its  logical  variables, 
if  all  the  conditions  in  its  body  are  true,  then  the  condition  in  its  head  is  also  true. 

The  procedural  interpretation  of  Setlog  clauses  is  similar  to  that  for  Horn  clauses, 
except  that  both  unification  and  reduction  are  used  in  simplifying  a  goal.  In  order  to 
reduce  a  goal 

iermi  €  f  (expi, . . . ,  expn) 

using  a  clause 

termi  €  f  (pi, . . . , pn)  ♦-  body 

we  unify  termi  and  term 2,  and  if  the  unification  is  successful,  we  replace  all  occurrences  of 
the  formal  parameters  pi, . . . , pn  throughout  the  body  of  f  by  expi, . . . ,  expn  respectively, 
and  then  reduce  the  goals  in  the  body  similarly.  Unification  can  bind  logical  variables  in 
both  the  goal  and  in  the  clause.  A  top-level  goal  is  solved  if  all  intermediate  goals  are 
solved  in  this  manner. 

2.2  Examples 
Set  Union 

X€union(a,t)  «—  X  €  s 
X€union(s,t)  «—  X  €  t 

?- X  €  union({l,2},  {2,3}) 

A  solution  would  bind  X  to  one  of  elements  from  the  set  {l ,  2.  3}.  If  the  implemen¬ 
tation  is  sequential  with  backtracking,  X  would  be  in  turn  bound  to  1,  2,  2,  and  3. 

There  is  no  need  in  Setlog  to  define  special  cases  for  when  one  of  the  arguments  to 
union  is  the  empty  set — there  is  no  explicit  representation  of  the  empty  set  in  Setlog.  If 
the  program  clauses  do  not  imply  any  terms  to  be  members  of  a  set  expression’s  denoted 
set,  then  that  set  expression  denotes  the  empty  set. 
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Set  Intersection 

X  €  intersect(s,  t)  «—  X  G  b,  XGt 

?- X  G  intersect (union({l ,2} ,  {3}),  {2,3.4}) 

In  finding  solutions  to  the  goal,  X  would  be  bound  in  turn  to  the  elements  2  and  3. 
Cross  Product 

[X  |  Y]  G  prod(s.  t)  «-  X  G  a.  Y  G  t 
?-  answer  G  prod({l,2},  {3.4}) 

In  computing  successive  solutions  to  the  goal,  answer  would  in  turn  be  bound  to  each 
term  from  the  set  {[1|3] ,  [1|4],  [2|3] ,  [2|4]}  . 

2.3  Syntactic  Sugars 

We  introduce  two  syntactic  sugars  to  make  programs  more  readable:  First,  we  let  a 
function  definition, 

term  E  f(.„,  term,,...)  «—  condition 
be  used  as  syntactic  sugar  for 

term\  G  f  (...,  p„ ...)  «—  termi  G  p,-,  condition 
where  p,  is  the  ith  formal  parameter.  Using  this  sugar,  the  cross  product  example  becomes 
[X|Y]  G  prod(X.  Y) 

For  the  other  syntactic  sugar,  whenever  a  term  t  is  used  in  the  goal  clause  or  in  a 
program  clause  body  where  a  set  expression  is  expected,  it  is  treated  as  {*},  the  singleton 
set  containing  t.  For  example, 

X  G  f  (A,  B) 

is  syntactic  sugar  for 

X  G  f({A}.  {B}) 

where  f  is  a  set-valued  function,  and  A  and  B  are  logical  variables. 

2.4  Expressive  Power  of  Setlog 

To  show  the  flexibility  of  the  Setlog  paradigm,  we  translate  typical  functional  and 
logic  programs  into  Setlog.  Below  are  equational  definitions  for  the  familiar  LISP  functions 
append  and  reverse. 


append{[  ],  y)  =  y 

append([h  |  t],  y)  =  eons[ht  append (t,  y)) 
reverse{\  ])  =  }  ] 

reverse([h  |  t])  =  appcnd(rcvcrsc(t) ,  [/*.]) 

These  functions  can  be  translated  mechanically  into  sugared  Setlog  functions: 

Y  €  append (  []  .  Y) 

[H|Z]  G  append([H|T] .  Y)  <-  Z  6  append(T,  Y) 

[]  6  reverse  ([]) 

Z  G  reverse  ([H|T])  <—  Z  G  append(reverseCT) ,  [H] ) 

Expanding  the  syntactic  sugars,  the  Setlog  definitions  become: 

Y  G  append(sl,  s2)  ♦-  []  G  si,  Y  G  s2 

[H|Z]  G  append(sl,  s2)  •«-  [H|T]  G  si,  Z  G  append({T},  s2) 

[]  G  reverse  (s)  ♦-  □  G  s 

Z  G  reverse(s)  *—  [H|T]  G  s,  Z  G  append ( re verse(  {T}  ),  {[H]}  ) 

To  see  that  the  translations  are  correct,  note  that  when  input  sets  are  singleton  (and 
ignoring  the  difference  between  a  term  and  the  singleton  set  containing  that  term) ,  then 

term  G  expression 

is  equivalent  to 

term  =  expression. 

When  append  is  applied  to  non-singleton  sets  si  and  s2,  the  resulting  denoted  set  contains 
all  terms  which  can  be  constructed  by  appending  a  term  in  s2  at  the  end  of  a  term  from 
si.  Similarly,  reverse  applied  to  a  set  of  lists  denotes  the  set  containing  every  list  which 
can  be  constructed  by  reversing  a  list  in  the  input  set. 

By  translating  to  Setlog,  we  gain  the  ability  to  rim  functions  backward,  as  in  Prolog. 
Not  only  can  we  solve  goals  of  the  form 

?- answer  G  appendC [1 .2] ,  [3,4]) 

corresponding  to  the  functional  goal 

?-  oppend([l,  2],  [3,4]) 

but  we  can  also  solve  goals  of  the  form 
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?-  [1.2. 3, 4]  6  reverse(X) 
which  corresponds  to  solving  the  equation 
?-  reverse(X)  =  (1,2, 3,4] 

The  ability  to  run  functions  backward  integrates  functional  and  logic  programming. 

Setlog  retains  the  full  power  of  Horn  logic  programming.  Any  Horn  logic  program 
can  be  mechanically  translated  into  a  Setlog  program.  Consider  the  following  program, 
written  in  DEC-10  Prolog  [CM81]: 
apd([  ],  Y,  Y). 

apd([H  |  T],  Y,[H  |  Z\)  :  -  apd{T,  Y,  Z). 
re v([  ],[]). 

rev{[H  |  T\,Y)  :  -  rev(T,Z),apd(Z,[H},Y). 
l-rev{X,{  1,2, 3, 4]). 

The  converted  sugared  Setlog  program  would  be: 

’true  €  apd([],  Y,  Y) 

’true  e  apd(  [H|T] .  Y.  [H|Z])  <-  ’true  G  apd(T.  Y.  Z) 

’true  G  rev(  []  .  []) 

’true  G  rev( [H|T]  .  Y)  4-  'true  G  rev(T.  Z)  .  ’true  6  apd(Z.  [H] .  Y) 
?-  ’true  G  rev(X.  [1.2. 3, 4]) 

In  the  next  two  sections,  we  develop  the  formal  semantics  of  Setlog.  Our  development 
closely  parallels  that  of  Van  Emden  and  Kowalski  [VK76]  and  Lloyd  [L84].  The  key 
difference  is  that  a  Horn  logic  program  defines  predicates  operating  on  terms,  whereas  a 
Setlog  program  defines  functions  mapping  tuples  of  term  sets  into  new  term  sets.  Setlog 
promotes  a  different  point  of  view,  more  so  than  a  new  mathematical  theory. 

3.  Model-theoretic  Semantics 

A  set  function  maps  a  tuple  of  ground  term-sets,  denoted  by  its  argument  set  expres¬ 
sions,  to  a  new  ground  term-set.  An  interpretation  maps  a  ground  set-expression  to  a  set 
of  ground  terms,  thereby  giving  meaning  to  the  set  functions.  An  interpretation  can  be 
expressed  as  a  set  of  statements  of  the  form: 

•fafi  6  gse i,  gti  e  gse2, . . .} 

where  gt{  is  a  ground  term  and  gsei  is  a  ground  set-expression.  Also  included  are  all 
statements  of  the  form: 


9 


where  gt  is  a  ground  term  —  the  tautology  that  any  given  ground  term  is  a  member  of 
the  singleton-set  containing  that  ground  term. 

A  program  model  is  an  interpretation  under  which  all  the  program  clauses  are  true. 
For  any  program  P  there  must  exist  at  least  one  model,  Bp,  the  model  containing 

gt  €  gse 

for  every  possible  combination  of  ground  term  gt  and  ground  set  expression  gse.  The 
intersection  of  all  models  of  a  program  P  is  therefore  also  a  model.  We  call  this  the  least 
model,  Mp. 

Theorem  1:  Let  P  be  a  Setlog  program.  Then 

Mp  =  {gt  €  gse  of  Bp  \  gt  €  gse  is  a  logical  consequence  of  program  P}. 

Proof: 

gt  G  gse  is  a  logical  consequence  of  P 
iff  P  and  gt  £  gse  is  logically  unsatisfiable 
iff  P  has  no  model  consistent  without  gt  €  gse 
iff  gt  €  gse  is  included  in  all  models  of  P 
iff  gt  G  gse  is  in  Mp. 

End  of  Proof. 

The  model-theoretic  semantics  of  a  Setlog  program  P  is  given  by  Mp. 

4.  Fixed  Point  Semantics 

We  define  2Bp  to  be  the  set  of  all  interpretations  of  a  Setlog  program  P.  These 
interpretations  form  a  complete  lattice  under  the  partial  order  set  inclusion,  C.  The  top 
element  is  Bp,  and  the  bottom  element,  X,  is  the  set  containing  all  statements  of  the  form 

gt  e  {gt} 

where  gt  is  a  ground  term,  but  containing  no  other  statements  involving  set-functions.  The 
lub  (least  upper  bound)  of  any  set  of  interpretations  is  the  union  of  those  interpretations. 
The  gib  (greatest  lower  bound)  of  any  set  of  interpretations  is  their  intersection. 

Let  P  be  a  program.  The  mapping  Tp :  2Bp  — ►  2Bp  is  defined  as  follows.  Let  I  be 
an  interpretation  of  P.  Then  Tp(I)  =  XU  {A  €  Bp  |  A  <—  Ai, ...,  An  is  a  ground 
instance  of  a  clause  in  P  and  {Ai, ...,  An}  C  /}. 


We  say  that  /  is  a  model  for  P 

iff  for  each  ground  instance  A  «—  A\, ...,  An  of  each  clause  in  P  we  have  {A\, ...,  An}  C  / 
implies  A  €  J 

iff  TP(I)  C  I. 

Since  2Bp  is  a  complete  lattice,  and  Tp  is  a  continuous  mapping,  the  fixed-point  semantics 
of  P,  lfp(T»  =  TP°°  (X). 

The  following  theorem  establishes  the  equivalence  of  the  fixed-point  and  model-theoretic 
semantics. 

Theorem  2:  Mp  =  Tp°°  (X). 

Proof: 

MP 

=  glb({J  |  7  is  a  model  for  P} 

=  glb({/  |  TP(I)  C  /}) 

=  lfp(7» 

End  of  Proof. 

A  program  goal  G  is  a  set  of  statements,  possibly  containing  logical  variables.  An 
answer  substitution  for  P  U  {G}  replaces  the  logical  variables  in  the  goal  with  ground 
terms.  Let  P  be  a  program,  G  a  goal,  and  $  an  answer  substitution.  6  is  a  correct  answer 
substitution  for  goal  G  under  program  P  iff  GO  is  a  logical  consequence  of  P.  Thus,  0  is  a 
correct  answer  substitution  for  G  under  P  iff  GO  is  true  with  respect  to  every  model  of  P 
iff  GO  is  true  with  respect  to  the  least  model  of  G. 

5.  Operational  Semantics 

A  goal  Gi  is  a  list  of  statements  Si,  ...,  S*,  where  each  Sj  is  of  the  form 
term  £  set-expr. 

Both  the  term  and  the  set  expression  may  contain  logical  variables  to  represent  unspecified 
ground  terms. 

Let  R  be  some  computation  rule  that  selects  a  statement  Sm  from  goal  G,.  We  have 
two  cases. 

Case  1:  Deletion  Upon  Unification 


.  »*,  j!.**.  ii  i *  «  ■-!  »  ; 


i  t  t  t1*  >w  tH  fc 1  *->  «  •«>  M 


Sm  is  of  the  form  term  G  {term0}. 

Let  $i+i  =  mgu(term,  termo). 

Let  m+i  =  <f>  (the  null  substitution). 

Let  Gj+i  —  (^li  •••»  Sm—  1  j  ^m+lj  •••)  Sk)&i+  1* 

Case  2:  Outermost  Reduction 

Sm  is  of  the  form  term  G  f{aet-expr\,  ...,  set-exprn), 
and  the  program  contains  a  clause  of  the  form 
termo  G  f{pi,  ...»  pn)  condition. 

Let  &x+i  =  mgu (term,  termo). 

Let  -yi+i  =  {pi  ♦—  set-expr\,  ...,  pn  «—  set-exprn}. 

Let  G|+i  =  {Si,  ...,  Sm—i,  condition  7i+i)  Sm+ i,  ••.,  i. 

In  either  case,  we  say  that 

Gi  with  o  ^  Gi+i  with  oOi+i, 

where  o  is  the  partial  answer  substitution  associated  with  Gi.  Note  that  computation 
of  7*+ii  which  binds  the  formal  parameters  in  an  outermost  reduction  step,  is  a  simple 
one-way  substitution;  it  requires  no  unification. 

Suppose  there  is  a  derivation 
G  =  Go  with  4>  ...  Gn  with  $\  ...  8n. 

Then  we  say 

G  =  G0  with  <f>  Gn  with  $i  ...  0n 

where  <j>  is  the  empty  answer  substitution,  and  61,82, ...»  8n  are  the  substitutions  associated 
with  steps  G\,G2, . . . , Gn.  Suppose 

G  0  with  0 1  ...  9n, 

where  0  is  the  empty  goal.  We  say  that 

0  =  6i  ...0n  T  T>(G) 

is  a  most  general  computed  answer  substitution,  where  V(G)  stands  for  the  variables  in  the 
top-level  goal  G. 

Soundness  and  completeness  theorems  state  the  equivalence  of  the  operational  and 
model-theortic  semantics.  Proofs  are  given  in  an  appendix. 
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Soundness  Theorem:  If  0  is  a  most  general  computed  answer  substitution,  and  r)  is  any 
extension  such  that  Or)  binds  ground  terms  to  all  the  variables  of  V  (G),  then  Or)  1  V  (G)  is 
a  correct  answer  substituion  as  defined  by  the  fixed-point  and  model-theoretic  semantics. 

Completeness  Theorem:  For  any  correct  answer  substitution  a ,  there  exists  a  derivation 
of  a  most  general  computed  answer  substitution  0  and  extension  rj  such  that  o  =  Or). 

5.2  Example  Derivation 

We  illustrate  the  operational  semantics  with  the  following  program  for  appending  two 
lists,  taken  from  section  2.2. 

Y  G  app(sl ,  s2)  <—  []  G  si.  Y  G  s2 

[H|Z]  €  app(sl ,  s2)  «-  [H|T]  €  si.  Z  €  app({T},  s2) 

?- Ans  G  app({[1.2]},  {[3.4]}) 

The  top-level  goal  would  be  written  as: 

Ans  G  app({ [1.2]},  {[3.4]})  with  ^ 

The  reduction  steps  leading  to  a  solution  are  given  below: 

[Hi|Ti]  €  {[1.2]}.  Zx  e  app({T,}.  {[3.4]}) 
with  {Ans  <—  [Hi|Zi]} 

Zi  €  app({[2]}.  {[3.4]}) 
with  {Ans  i-  [1|ZX]  .  Hi  «-  1,  Ti  «-  [2]} 

^  [H2|T3]  G  {[2]}.  Z3  G  app({Ta}.  {[3.4]}) 

with  {Ans  «—  [1  |  [H2 |Z2 ]  ]  .  Hi  «—  1,  Ti  «—  [2],  Zi  «—  [H2 |Z2] } 

Zj  G  app({Tj},  {[3.4]}) 

with  {Ans  i-  [1  |  [2{Z2]  ] ,  Hi  «-  1.  Tt  «-  [2] .  Zx  [2|Z3]  . 

H3  -  2.  T3  -  []} 

~  □  6  {[]}.  Z3  G  {[3.4]}) 

with  {Ans  ♦-  [1  |  [2|Z3]] ,  Ht  <-  1.  Ti  <-  [2] .  Zx  [2|Z3] . 


with  {Ans  4-  [1  |  [2|Z2]].  Hi  4-  1.  h  «-  [2]  .  Zx  <-  [2|Z2]  . 

H2  *—  2,  T2  ♦—  []  ,  Y2  4—  Z2} 

,v*  <f> 

with  {Ans  4-  [1.2. 3, 4],  Hx  4-  1.  Ti  «-  [2],  Zi  <-  [2.3.4], 

H2  4-  2.  T2  4-  []  .  Y3  ♦-  [3.4],  Z2  4-  [3.4]} 

The  restriction  of  the  final  substitution  to  the  variables  in  the  top-level  goal  yields  {Ans 
4—  [1.2. 3, 4]}  as  the  computed  answer. 

5.3  Possible  Extensions  to  Higher  Order  Functions 

In  Prolog  and  other  Horn  logic  languages,  all  variables  are  bound  by  unification.  Ex¬ 
tending  these  languages  to  handle  higher-order  predicates  requires  higher-order  unification, 
a  procedure  known  to  be  undecidable  [H71],  In  contrast,  Setlog  distinguishes  between  log¬ 
ical  variables  and  formal  parameters,  and  binds  the  latter  by  one-way  substitutions.  For 
this  reason,  we  believe  Setlog  is  more  amenable  to  higher-order  extension.  For  instance,  it 
is  possible  to  generalize  LISP’s  mapcar  function  to  operate  upon  set- valued  functions  and 
sets  of  lists,  written  as  follows: 

[]  e  mapcar (f,  []) 

[A|B]  e  mapcar (f.  [H|T] )  4-  A  €  f(H),  B  G  mapcar(f.  T) 

In  this  example,  we  have  let  the  formal  parameter  t  stand  for  a  function,  instead  of  a  set 
expression. 

We  do  not  envision  the  need  to  solve  for  logical  variables  denoting  functions  or  pred¬ 
icates,  since  neither  first-order  logic  languages  nor  higher-order  functional  languages  offer 
this  capability.  We  feel  that  one  of  the  chief  advantages  of  the  Setlog  approach  is  its  poten¬ 
tial  to  deal  with  higher-order  functions  without  getting  into  the  problem  of  higher-order 
unification. 

7.  Conclusions 

We  have  shown  that  Setlog  has  semantic  and  operational  simplicity  comparable  to 
pure  Prolog  (Horn  logic),  and  has  equal  power.  Setlog  is  similar  in  that  a  program  con¬ 
sists  of  a  set  of  definite  clauses,  and  execution  searches  for  substitutions  binding  terms  to 
logical  variables  in  a  goal  clause,  making  the  goal  a  logical  consequence  of  the  program. 


Whereas  Prolog  clauses  describe  predicates  defined  on  terms,  Setlog  clauses  describe  func¬ 
tions  defined  on  sets  of  terms.  In  Prolog  there  is  no  distinction  between  logical  variables 
and  procedural  formal  parameters.  Incorporating  higher-order  objects  into  Prolog  would 
require  higher-order  unification.  In  Setlog,  the  distinction  between  logical  variables  and 
formal  parameters  is  marked.  A  logical  variable  stands  for  a  term;  a  formal  parameter 
stands  for  a  set  of  terms.  Logical  variables  are  bound  through  unification,  as  in  Pro¬ 
log.  The  substitutions  binding  formal  parameters  to  expressions  are  created  by  a  one-way 
matching,  as  in  functional  programming.  As  a  result,  incorporating  higher-order  functions 
into  Setlog  may  be  easier.  Functions  can  be  passed  as  arguments  without  requiring  uni¬ 
fication  of  functions.  We  are  at  present  investigating  a  higher-order  extension  of  Setlog 
[S87]. 

Set  theory  and  predicate  logic  are  closely  related  mathamatical  formalisms.  The 
semantics  of  predicate  logic  are  defined  in  terms  of  set  theory  (the  Herbrand  universe  is 
a  set,  as  is  a  Herbrand  interpretation  and  a  Herbrand  model).  On  the  other  hand,  the 
axioms  of  set  theory  can  be  encoded  as  a  theory  within  predicate  logic.  Thus,  set  theory 
and  the  predicate  logic  are  two  alternative  but  equivalent  theories  [Q82],  though  some 
problems  are  expressed  more  naturally  in  one  than  in  the  other.  Because  set  theory  and 
predicate  logic  are  so  closely  related,  the  semantics  of  Setlog  resemble  the  semantics  of 
Horn  clause  logic  programming  (VK76,  L84].  However,  our  semantics  defines  a  system  of 
set-valued  functions,  instead  of  general  predicates  operating  upon  terms. 

As  we  wished  to  concentrate  on  semantics,  we  have  not  addressed  implementation 
issues  in  this  paper.  However,  the  reader  may  see  that  techniques  for  sequential  Horn  logic 
implementation  [WPP77]  are  adaptable  to  the  implementation  of  Setlog.  It  is  also  possible 
to  exploit  both  and  and  or  parallelism  [CK81]  in  the  execution  of  Setlog  programs. 
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Appendix  -  Correctness  Proofs 

We  establish  here  the  correctness  of  the  operational  semantics.  We  build  up  to  the 
soundness  and  completeness  theorems  by  first  proving  a  series  of  lemmas.  These  proofs  re¬ 
semble  the  proofs  for  soundness  and  completeness  of  Horn  clause  resolution.  The  similarity 
is  to  be  expected,  since  both  Setlog  and  pure  Prolog  define  a  program  as  a  collection  of 
definite  clauses.  In  the  case  of  Setlog,  the  definite  clauses  determine  the  terms  included  in  a 
set-expression's  denoted  set;  in  Prolog  the  clauses  establish  the  truth  of  ground  predicates. 
We  begin  with  two  definitions: 

An  unrestricted  derivation  is  like  the  ^-derivations  described  earlier,  except  that  the 
unifiers  need  not  be  most  general. 

The  success  set  of  a  Setlog  program  is  the  set  of  all  ground  conditions  A  in  Bp  such 
that  A  0. 

The  empty  goal  0,  i.e.  the  goal  with  no  conditions,  is  trivially  satisfied,  and  is  thus  a 
logical  consequence  of  all  Setlog  programs.  Given  a  goal  G  (a  set  of  conditions)  containing 
logical  variables,  and  a  Setlog  program  P,  we  say  that  G  is  a  logical  consequence  of  P  if  for 
all  substitutions  q  which  bind  the  logical  variables  of  G  to  ground  terms,  Gq  is  a  logical 
consequence  of  P. 

Lemma  1.  Suppose  we  have  a  derivation 
Go  Gn  with  6 

then  for  i  =  0 , . . .  n,  G,  contains  no  formal  parameters. 

Proof:  Go  is  a  top-level  goal  clause.  By  definition,  it  cannot  contain  formal  parameters. 
To  prove  by  induction,  we  need  only  show  that  if  G,-  contains  no  formal  parameters, 
than  neither  does  G,+ 1.  We  must  consider  two  cases,  depending  on  whether  this  step  is  a 
deletion  upon  unification,  or  an  outermost  reduction.  In  the  case  of  deletion  by  unification, 
we  unify  two  terms,  term  and  termo  (neither  of  which  may  contain  formal  parameters), 


delete  the  condition,  and  apply  the  unifier  to  the  remaining  conditions  in  the  goal  clause. 
Since  neither  the  unifier  nor  the  remaining  conditions  contain  formal  parameters,  than 
neither  does  the  new  goal  clause.  In  the  case  of  an  outermost  reduction,  0,-+i  unifies  two 
terms,  term  and  termo,  both  free  of  formal  parameters,  and  thus  introduces  no  formal 
parameters  into  the  new  goal.  The  condition  of  the  program  clause  does  contain  formal 
parameters,  but  these  are  all  removed  by  the  application  of  'U+i- 

End  of  Proof. 

Lemma  2.  Given  some  goal  Go  under  some  Setlog  program,  and  a  derivation 
Go  Gn  with  0 

If  Gn  is  a  logical  consequence  of  the  program,  then  so  is  Gq8. 

Proof:  The  proof  is  by  induction  on  the  number  of  steps  in  the  derivation.  It  is  obviously 
true  with  a  derivation  of  zero  steps,  for  which  Go  =  Gn,  and  0  is  the  empty  substitution 
<f>.  Suppose  it  is  true  for  a  derivation  of  n  —  1  steps.  We  need  show  that  it  is  then  true 
for  a  derivation  of  n  steps.  Suppose  Go  consists  of  the  conditions  A\, ...,  A*,  and  that  the 
unifiers  of  the  n-step  derivation  are  0\, 8n.  Since  we  have 

G\  Gn  with  02. ..0n, 

G i02-.. 0n  is  a  logical  consequence  of  the  program,  which  is  the  same  as  G\0\...0n  (since  the 
last  step  in  forming  G\  from  Go  is  application  of  0i-  To  examine  the  truth  of  Go0i—0n, 
we  must  consider  two  cases.  If  the  first  derivation  step  was  a  deletion  upon  unification, 
then  Go8\...0n  contains  exactly  the  same  conditions  as  G\0\...0n,  but  with  an  additional 
condition  of  the  form 

term  €  {term}, 

which  is  a  logical  consequence  of  all  Setlog  programs.  Since  G\0i...0n  is  a  logical  conse¬ 
quence  of  P,  then  so  is  Go0o...0n. 

Otherwise,  the  first  derivation  step  was  an  outermost  reduction,  applied  to  a  condition  in 
Go 

term  €  f(set-ezpri,  ...,  aet-ezprn), 
using  a  program  clause  of  the  form 

termo  €  f{pu  •••»  Pn)  condition, 

with 
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8\  =  mgu  (term,  ter  mo), 

'll  =  {pi  «-  set-expri,  ...,  pn  «-  set-exprn). 

G o0i  and  G j  are  identical,  except  that  G o0i  has  a  condition  which  can  be  written  as 
( term  €  /(p  1,  ....  pn))~l  181 
where  G  i  has 

conditioned  i . 

The  program  clause  states  that  any  substitution  making  the  latter  true  also  makes  the 
former  true.  Since  G 187. ..8n  is  a  logical  consequence  of  the  program,  so  is  Go8\8i...8n. 

End  of  Proof. 

Theorem  3:  Soundness  Theorem.  If  8  is  a  computed  answer  substitution  of  goal  G 
(under  Setlog  program  P),  then  G0  is  a  logical  consequence  of  the  program. 

Proof:  There  is  a  derivation  G  0  with  6',  where  8  =  8']  V (G).  Since  the  empty 

goal  is  a  logical  consequence  of  all  programs,  and  by  Lemma  2,  G8'  —  G8  is  a  logical 
consequence  of  the  program,  0  is  a  correct  answer  substitution. 

End  of  Proof. 

Now  we  develop  the  completeness  proof. 

Lemma  4:  MGU  Lemma.  Suppose  there  is  a  goal  G  and  a  Setlog  program  P  with  an 
unrestricted  derivation: 

G  <>  with  8x...0n, 

Then  there  exists  a  restricted  derivation 

G  0  with  8,1...8'n, 

(8i,  ...,  0n  are  most  general  unifiers)  and  a  substitutions  a  such  that 
0x...8no  = 

Proof:  The  proof  is  by  induction  on  the  length  of  the  derivation.  Suppose  n  =  1,  and 
G  <>  with  6\, 

where  0i  is  not  necessarily  a  most  general  unifier.  Then  there  exists  a  most  general  unifier, 
0\,  such  that 


G  <>  with  $[, 

61  >  e\. 

and  there  exists  a  substitution  sigma  such  that 
=  9yO. 

Suppose  the  result  holds  for  n  —  1,  and  there  is  an  unrestricted  derivation 
G  =  Go  0  with  0\...0n. 

Let  be  the  most  general  unifier  of  the  terms  unified  in  the  first  derivation  step.  Then 
there  exists  a  substitution  p  such  that 

$i  =  9\p. 

Defining  G\  =  Gyp,  we  see  that 
G0  'v*  Gy  with  9[, 

Gy  Gj  with  pO 2. 

G\  'v*  0  with  P0203— On- 

By  the  induction  hypothesis,  there  exists  a  substitution  a  and  restricted  derivation 
G\  0  with  0'2...9'n. 

p02...0n  =  p0'2...0'n. 

Combining  this  derivation  with  the  first  step,  we  have  restricted  derivation 
G  =  G0  0  with0'...0'n. 

0x...0n  =  O[p92...0n  = 

End  of  Proof. 

Theorem  4.  The  success  set  of  a  Setlog  program  is  equal  to  its  least  model. 

Proof:  The  soundness  theorem  implies  that  the  success  set  is  contained  in  the  least  model. 
We  need  only  show  that  the  success  set  contains  the  least  model.  Consider  any  arbitrary 
ground  condition  A  in  Bp  ( A  is  of  the  form  term  G  setexpr).  By  Theorem  2,  there  exists 
an  integer  n  such  that 

A  €  7>n(±) 

(or  TPn  for  short).  We  prove  by  induction  on  n. 
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Suppose  n  =  0.  Then  A  €  X.  A  is  of  the  form 
ground-term  6  {ground-term}. 

We  have 

A  0  with  <f>. 

by  deletion  upon  unification  [<f>  is  the  empty  substitution).  A  is  in  the  success  set. 

Suppose  the  result  holds  for  n  —  1.  By  the  definition  of  Tp,  there  exists  a  ground  instance 
of  a  program  clause 

B  <—  Bi,  ...,  Bk 

(by  ground  instance  we  mean  substitution  set  expressions  for  the  formal  parameters  with 
some  substitution  7  and  ground  terms  for  the  logical  variables  with  some  substitution  0) 
such  that  A  =  B^B,  and 

{B17O,  ...,  Bk'iB}  C  Tp  t  (n  -  1), 

A  {B^B,  ....  J3*)70. 

with  an  unrestricted  derivation  step.  By  the  induction  hypothesis,  each  Bi'yO  is  in  the 
success  set.  Because  each  B^B  is  ground,  their  derivations  can  be  combined  into  a  single 
derivation 

(Bi,  ....  Bfc)70  'vV  O. 

Thus  A  0,  and  by  the  MGU  lemma,  there  exists  a  restricted  derivation,  as  well,  A  is 
a  member  of  the  success  set. 

End  of  Proof. 

Lemma  5.  Let  A  be  a  condition  of  the  form  term  €  setexpr,  and  a  logical  consequence 
of  a  Setlog  program  P.  Then  there  is  a  derivation 

A  0  with  a, 

a  T  V(A)  =  *■ 

The  second  formula  states  that  a  does  not  affect  any  of  the  logical  variables  in  A,  i.e. 
Act  =  A. 

Proof:  Suppose  A  has  variables  Xi,  ...,  Xn.  Let  aj,  ...,  an  be  distinct  new  constants 
(appearing  neither  in  the  program  P  nor  in  A).  Define  0  =  {ATj  <—  ax,  ...,  Xn  «—  an}. 
Then  AB  is  a  logical  consequence  of  P.  Since  AB  is  ground,  Theorem  4  states  that 


AO  0  with  o', 
a1  t  V(A6)  =  4. 

Since  the  a<  do  not  appear  in  P  nor  A,  by  textually  replacing  all  at-  by  X,  in  this  derivation 
we  obtain  the  derivation 

A  0  with  a, 

a  T  V{A)  =  4>. 

End  of  Proof. 

Theorem  5:  Completeness  Theorem.  Let  P  be  a  Setlog  program  and  G  a  goal.  For 
every  correct  answer  substitution  0  there  exists  a  derivation 

G  <>  with  a , 

and  a  substitution  rj  such  that 

0  =  [a  r  V(G))»7. 

Proof:  Let  G  =  A\,  An.  Since  0  is  correct,  then  GO  is  a  logical  consequence  of  P. 
By  Lemma  5, 

Ai  ^  with  0\y 

a  T  V(Ai)  =  <f>. 

We  can  combine  these  into  a  single  derivation 
G  0  with  a, 

<7  T  V(X.)  =  <t>. 

Suppose  that  the  sequence  of  mgu’s  in  the  derivation  is  0\,  thetam.  Then 
GO  —  GOO  —  GB0\...6rn. 

By  the  MGU  Lemma, 

G  0  with  0\...0,mt 

such  that  for  some  17',  00\...0m  =  O^.-.O^tj1  .  So,  let  a  =  O^.-.O^  and  rj  =  q'  |  !>((?). 
End  of  Proof. 


23 


C 


